Skip to content

Adaptive (runtime, stats-based) conjunct reordering for FilterExec#22698

Draft
adriangb wants to merge 4 commits into
apache:mainfrom
pydantic:lift-selectivity-stats
Draft

Adaptive (runtime, stats-based) conjunct reordering for FilterExec#22698
adriangb wants to merge 4 commits into
apache:mainfrom
pydantic:lift-selectivity-stats

Conversation

@adriangb

@adriangb adriangb commented Jun 1, 2026

Copy link
Copy Markdown
Contributor

Which issue does this PR close?

Rationale for this change

Predicate evaluation order matters: running a selective predicate first lets it
gate the work of the predicates after it. The static cheap/expensive heuristic
(#22343) sorts conjuncts into two cost classes and stable-sorts within each, so
it does nothing to order multiple similarly-expensive predicates; and
BinaryExpr's AND short-circuit only gates on a leftmost selective conjunct.
So a conjunction of several expensive predicates whose selective member is not
written first is evaluated with every predicate scanning ~every row — and
neither mechanism fixes it.

This PR adds runtime, statistics-based conjunct reordering for FilterExec:
it measures each conjunct's selectivity and cost on the rows that actually reach
it and runs the ones that discard the most rows per unit of CPU time first.
Maximising discards-per-second is exactly minimising cost_per_row / (1 - pass_rate),
the classic optimal ordering key for independent conjuncts.

It is off by default (datafusion.execution.adaptive_filter_reordering).

What changes are included in this PR?

Split into four reviewable commits:

  1. physical-expr-common: adaptive selectivity-stats substrate — a
    policy-free SelectivityStats (online selectivity + cost with Welford
    mean/variance and confidence bounds) and a concurrent AdaptiveStatsRegistry.
    Reusable by other consumers (e.g. a future parquet-scan integration).

  2. common: config flag execution.adaptive_filter_reordering (default
    false), plus regenerated configs.md / information_schema listing.

  3. physical-plan: adaptive conjunct reordering in FilterExec — a
    stream-local evaluator that:

    • evaluates conjuncts sequentially with threshold-gated compaction (mirroring
      BinaryExpr's pre-selection) and measures each marginally;
    • ranks by mean discards-per-second and freezes as soon as the ranking is
      statistically certain
      (adjacent confidence intervals stop overlapping),
      or after a small sample cap if the conjuncts are indistinguishable;
    • on freeze, fuses the conjuncts into a single left-deep AND in the learned
      order and evaluates it as an ordinary predicate (no measurement overhead,
      inherits BinaryExpr pre-selection);
    • periodically re-thaws with exponential backoff to catch distribution
      drift, so steady-state overhead decays toward zero.

    State is stream-local; the plan, results, and EXPLAIN are unchanged.

  4. Tests — an end-to-end .slt asserting identical results/plan with the
    flag on and off.

Are these changes tested?

Yes:

  • Unit tests for the substrate (SelectivityStats, registry) and the
    FilterExec evaluator (gating correctness, certainty-freeze, re-thaw backoff,
    drift adaptation).
  • adaptive_filter.slt: results and EXPLAIN identical with the flag on/off.

Are there any user-facing changes?

One new config option, datafusion.execution.adaptive_filter_reordering
(experimental, default false). When enabled, the order in which a conjunctive
filter's predicates are evaluated may change at runtime; results are unchanged,
but observable side effects of fallible predicates could differ (predicates
containing volatile expressions are never reordered).

@github-actions github-actions Bot added documentation Improvements or additions to documentation physical-expr Changes to the physical-expr crates sqllogictest SQL Logic Tests (.slt) common Related to common crate physical-plan Changes to the physical-plan crate labels Jun 1, 2026
@adriangb

adriangb commented Jun 1, 2026

Copy link
Copy Markdown
Contributor Author

run benchmarks

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4593095184-389-pg64x 6.12.68+ #1 SMP Wed Apr 1 02:23:28 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (5e71ea4) to 85bc5ef (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4593095184-391-ctjg4 6.12.68+ #1 SMP Wed Apr 1 02:23:28 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (5e71ea4) to 85bc5ef (merge-base) diff using: tpch
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4593095184-390-w2tfn 6.12.68+ #1 SMP Wed Apr 1 02:23:28 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (5e71ea4) to 85bc5ef (merge-base) diff using: tpcds
Results will be posted here when complete


File an issue against this benchmark runner

@github-actions

github-actions Bot commented Jun 1, 2026

Copy link
Copy Markdown

Thank you for opening this pull request!

Reviewer note: cargo-semver-checks reported the current version number is not SemVer-compatible with the changes in this pull request (compared against the base branch).

Details
     Cloning apache/main
    Building datafusion-common v53.1.0 (current)
       Built [  33.102s] (current)
     Parsing datafusion-common v53.1.0 (current)
      Parsed [   0.058s] (current)
    Building datafusion-common v53.1.0 (baseline)
       Built [  32.207s] (baseline)
     Parsing datafusion-common v53.1.0 (baseline)
      Parsed [   0.059s] (baseline)
    Checking datafusion-common v53.1.0 -> v53.1.0 (no change; assume patch)
     Checked [   0.645s] 223 checks: 222 pass, 1 fail, 0 warn, 30 skip

--- failure constructible_struct_adds_field: externally-constructible struct adds field ---

Description:
A pub struct constructible with a struct literal has a new pub field. Existing struct literals must be updated to include the new field.
        ref: https://doc.rust-lang.org/reference/expressions/struct-expr.html
       impl: https://github.com/obi1kenobi/cargo-semver-checks/tree/v0.48.0/src/lints/constructible_struct_adds_field.ron

Failed in:
  field ExecutionOptions.adaptive_filter_reordering in /home/runner/work/datafusion/datafusion/datafusion/common/src/config.rs:525

     Summary semver requires new major version: 1 major and 0 minor checks failed
    Finished [  68.128s] datafusion-common
    Building datafusion-physical-expr-common v53.1.0 (current)
       Built [  22.857s] (current)
     Parsing datafusion-physical-expr-common v53.1.0 (current)
      Parsed [   0.022s] (current)
    Building datafusion-physical-expr-common v53.1.0 (baseline)
       Built [  22.930s] (baseline)
     Parsing datafusion-physical-expr-common v53.1.0 (baseline)
      Parsed [   0.022s] (baseline)
    Checking datafusion-physical-expr-common v53.1.0 -> v53.1.0 (no change; assume patch)
     Checked [   0.246s] 223 checks: 223 pass, 30 skip
     Summary no semver update required
    Finished [  47.012s] datafusion-physical-expr-common
    Building datafusion-physical-plan v53.1.0 (current)
       Built [  34.291s] (current)
     Parsing datafusion-physical-plan v53.1.0 (current)
      Parsed [   0.127s] (current)
    Building datafusion-physical-plan v53.1.0 (baseline)
       Built [  34.897s] (baseline)
     Parsing datafusion-physical-plan v53.1.0 (baseline)
      Parsed [   0.128s] (baseline)
    Checking datafusion-physical-plan v53.1.0 -> v53.1.0 (no change; assume patch)
     Checked [   0.628s] 223 checks: 223 pass, 30 skip
     Summary no semver update required
    Finished [  72.336s] datafusion-physical-plan
    Building datafusion-sqllogictest v53.1.0 (current)
       Built [ 162.689s] (current)
     Parsing datafusion-sqllogictest v53.1.0 (current)
      Parsed [   0.022s] (current)
    Building datafusion-sqllogictest v53.1.0 (baseline)
       Built [ 162.354s] (baseline)
     Parsing datafusion-sqllogictest v53.1.0 (baseline)
      Parsed [   0.024s] (baseline)
    Checking datafusion-sqllogictest v53.1.0 -> v53.1.0 (no change; assume patch)
     Checked [   0.095s] 223 checks: 223 pass, 30 skip
     Summary no semver update required
    Finished [ 329.459s] datafusion-sqllogictest

@github-actions github-actions Bot added the auto detected api change Auto detected API change label Jun 1, 2026
@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark tpch_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━┓
┃ Query     ┃                           HEAD ┃         lift-selectivity-stats ┃       Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━┩
│ QQuery 1  │ 38.60 / 40.15 ±1.11 / 41.83 ms │ 38.12 / 39.19 ±1.29 / 41.17 ms │    no change │
│ QQuery 2  │ 18.25 / 18.94 ±0.67 / 20.10 ms │ 18.60 / 18.75 ±0.13 / 18.95 ms │    no change │
│ QQuery 3  │ 33.32 / 34.21 ±1.08 / 36.32 ms │ 30.89 / 32.94 ±2.44 / 37.48 ms │    no change │
│ QQuery 4  │ 17.20 / 17.65 ±0.66 / 18.95 ms │ 17.06 / 17.50 ±0.44 / 18.33 ms │    no change │
│ QQuery 5  │ 38.04 / 40.19 ±1.13 / 41.33 ms │ 40.74 / 42.20 ±2.45 / 47.08 ms │ 1.05x slower │
│ QQuery 6  │ 16.08 / 16.16 ±0.07 / 16.28 ms │ 16.09 / 16.73 ±1.11 / 18.93 ms │    no change │
│ QQuery 7  │ 42.55 / 44.68 ±2.22 / 47.81 ms │ 45.16 / 46.48 ±1.73 / 49.80 ms │    no change │
│ QQuery 8  │ 42.44 / 43.33 ±0.89 / 44.53 ms │ 42.49 / 42.82 ±0.28 / 43.32 ms │    no change │
│ QQuery 9  │ 48.18 / 49.13 ±0.98 / 50.46 ms │ 49.06 / 49.77 ±0.51 / 50.47 ms │    no change │
│ QQuery 10 │ 41.65 / 41.84 ±0.21 / 42.23 ms │ 41.93 / 42.78 ±1.37 / 45.51 ms │    no change │
│ QQuery 11 │ 13.18 / 13.51 ±0.33 / 14.08 ms │ 13.02 / 13.32 ±0.28 / 13.79 ms │    no change │
│ QQuery 12 │ 23.50 / 24.05 ±0.34 / 24.37 ms │ 23.81 / 24.04 ±0.21 / 24.40 ms │    no change │
│ QQuery 13 │ 31.80 / 34.24 ±2.04 / 37.51 ms │ 32.99 / 34.81 ±1.63 / 37.50 ms │    no change │
│ QQuery 14 │ 23.34 / 23.65 ±0.22 / 23.92 ms │ 23.54 / 23.73 ±0.12 / 23.88 ms │    no change │
│ QQuery 15 │ 32.38 / 33.30 ±0.90 / 35.00 ms │ 31.43 / 31.78 ±0.44 / 32.51 ms │    no change │
│ QQuery 16 │ 13.98 / 14.07 ±0.09 / 14.24 ms │ 13.83 / 14.03 ±0.16 / 14.31 ms │    no change │
│ QQuery 17 │ 72.51 / 73.77 ±1.24 / 76.01 ms │ 73.86 / 75.71 ±1.90 / 79.27 ms │    no change │
│ QQuery 18 │ 57.89 / 60.26 ±2.14 / 64.27 ms │ 59.46 / 59.99 ±0.28 / 60.24 ms │    no change │
│ QQuery 19 │ 32.56 / 32.89 ±0.38 / 33.61 ms │ 32.71 / 33.12 ±0.54 / 34.17 ms │    no change │
│ QQuery 20 │ 31.90 / 32.26 ±0.32 / 32.79 ms │ 31.92 / 32.21 ±0.24 / 32.51 ms │    no change │
│ QQuery 21 │ 55.45 / 56.90 ±1.26 / 58.57 ms │ 54.49 / 56.90 ±1.29 / 58.32 ms │    no change │
│ QQuery 22 │ 13.94 / 14.24 ±0.35 / 14.83 ms │ 13.59 / 14.17 ±0.46 / 14.93 ms │    no change │
└───────────┴────────────────────────────────┴────────────────────────────────┴──────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━┓
┃ Benchmark Summary                     ┃          ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 759.43ms │
│ Total Time (lift-selectivity-stats)   │ 762.96ms │
│ Average Time (HEAD)                   │  34.52ms │
│ Average Time (lift-selectivity-stats) │  34.68ms │
│ Queries Faster                        │        0 │
│ Queries Slower                        │        1 │
│ Queries with No Change                │       21 │
│ Queries with Failure                  │        0 │
└───────────────────────────────────────┴──────────┘

Resource Usage

tpch — base (merge-base)

Metric Value
Wall time 5.0s
Peak memory 5.7 GiB
Avg memory 4.9 GiB
CPU user 29.6s
CPU sys 2.2s
Peak spill 0 B

tpch — branch

Metric Value
Wall time 5.0s
Peak memory 5.7 GiB
Avg memory 4.9 GiB
CPU user 29.9s
CPU sys 2.2s
Peak spill 0 B

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │           5.90 / 6.41 ±0.87 / 8.16 ms │           5.90 / 6.37 ±0.88 / 8.12 ms │     no change │
│ QQuery 2  │        79.54 / 79.77 ±0.16 / 79.99 ms │        81.11 / 81.52 ±0.25 / 81.79 ms │     no change │
│ QQuery 3  │        28.45 / 28.95 ±0.29 / 29.31 ms │        28.91 / 29.32 ±0.30 / 29.68 ms │     no change │
│ QQuery 4  │     483.36 / 487.62 ±3.22 / 492.72 ms │     487.01 / 490.85 ±3.13 / 496.59 ms │     no change │
│ QQuery 5  │        51.50 / 52.40 ±1.04 / 54.27 ms │        52.25 / 52.73 ±0.53 / 53.72 ms │     no change │
│ QQuery 6  │        36.14 / 36.44 ±0.29 / 36.86 ms │        36.54 / 36.83 ±0.23 / 37.24 ms │     no change │
│ QQuery 7  │        95.37 / 96.57 ±0.77 / 97.70 ms │        95.91 / 96.45 ±0.29 / 96.75 ms │     no change │
│ QQuery 8  │        36.31 / 38.71 ±3.23 / 45.06 ms │        37.82 / 39.53 ±2.72 / 44.95 ms │     no change │
│ QQuery 9  │        54.00 / 54.69 ±0.54 / 55.52 ms │        53.60 / 55.54 ±2.05 / 59.20 ms │     no change │
│ QQuery 10 │        68.71 / 68.99 ±0.33 / 69.61 ms │        68.69 / 68.91 ±0.16 / 69.09 ms │     no change │
│ QQuery 11 │     300.63 / 303.06 ±3.10 / 309.17 ms │     295.87 / 299.86 ±2.69 / 304.15 ms │     no change │
│ QQuery 12 │        28.43 / 28.81 ±0.28 / 29.24 ms │        28.53 / 28.87 ±0.26 / 29.19 ms │     no change │
│ QQuery 13 │     118.99 / 119.74 ±0.64 / 120.56 ms │     119.54 / 120.06 ±0.32 / 120.40 ms │     no change │
│ QQuery 14 │     505.84 / 510.44 ±4.11 / 516.34 ms │    505.29 / 511.56 ±10.27 / 531.95 ms │     no change │
│ QQuery 15 │        57.96 / 59.10 ±1.12 / 61.21 ms │        58.92 / 60.13 ±0.79 / 61.03 ms │     no change │
│ QQuery 16 │           6.79 / 6.95 ±0.14 / 7.19 ms │           7.01 / 7.12 ±0.16 / 7.43 ms │     no change │
│ QQuery 17 │        81.13 / 82.48 ±1.32 / 84.77 ms │        80.81 / 82.01 ±0.97 / 83.70 ms │     no change │
│ QQuery 18 │     123.87 / 124.79 ±0.88 / 126.34 ms │     124.82 / 125.69 ±0.58 / 126.33 ms │     no change │
│ QQuery 19 │        41.80 / 42.60 ±0.64 / 43.71 ms │        42.14 / 42.69 ±0.47 / 43.47 ms │     no change │
│ QQuery 20 │        35.40 / 35.82 ±0.24 / 36.06 ms │        35.98 / 36.36 ±0.35 / 36.91 ms │     no change │
│ QQuery 21 │        16.95 / 17.15 ±0.13 / 17.30 ms │        17.07 / 17.18 ±0.12 / 17.35 ms │     no change │
│ QQuery 22 │        62.03 / 62.94 ±0.90 / 64.39 ms │        62.37 / 62.75 ±0.50 / 63.73 ms │     no change │
│ QQuery 23 │     340.69 / 344.85 ±3.41 / 349.87 ms │     343.40 / 346.64 ±2.72 / 350.29 ms │     no change │
│ QQuery 24 │     224.58 / 225.61 ±1.48 / 228.51 ms │     224.25 / 226.42 ±2.25 / 229.54 ms │     no change │
│ QQuery 25 │     113.57 / 115.02 ±0.99 / 116.64 ms │     113.86 / 116.90 ±2.35 / 120.68 ms │     no change │
│ QQuery 26 │        57.88 / 58.43 ±0.32 / 58.86 ms │        58.39 / 59.03 ±0.51 / 59.64 ms │     no change │
│ QQuery 27 │           6.57 / 6.72 ±0.17 / 7.04 ms │           6.70 / 6.80 ±0.15 / 7.10 ms │     no change │
│ QQuery 28 │        56.57 / 60.29 ±1.86 / 61.34 ms │        60.47 / 61.60 ±0.67 / 62.38 ms │     no change │
│ QQuery 29 │      98.79 / 101.09 ±1.99 / 104.71 ms │       97.92 / 99.46 ±1.69 / 102.26 ms │     no change │
│ QQuery 30 │        31.60 / 32.70 ±0.69 / 33.68 ms │        32.21 / 32.74 ±0.48 / 33.44 ms │     no change │
│ QQuery 31 │     111.52 / 113.02 ±2.19 / 117.37 ms │     111.18 / 112.74 ±1.09 / 114.37 ms │     no change │
│ QQuery 32 │        20.37 / 21.35 ±1.39 / 24.04 ms │        20.11 / 20.35 ±0.30 / 20.90 ms │     no change │
│ QQuery 33 │        38.36 / 39.26 ±0.95 / 40.99 ms │        38.44 / 38.68 ±0.20 / 39.03 ms │     no change │
│ QQuery 34 │          9.23 / 9.68 ±0.48 / 10.45 ms │          9.40 / 9.70 ±0.26 / 10.04 ms │     no change │
│ QQuery 35 │        76.36 / 77.29 ±0.64 / 78.22 ms │        76.03 / 76.53 ±0.38 / 77.11 ms │     no change │
│ QQuery 36 │           5.90 / 6.06 ±0.17 / 6.37 ms │           5.97 / 6.11 ±0.18 / 6.46 ms │     no change │
│ QQuery 37 │           6.71 / 6.85 ±0.15 / 7.10 ms │           6.80 / 6.87 ±0.05 / 6.94 ms │     no change │
│ QQuery 38 │        62.45 / 64.12 ±1.23 / 65.90 ms │        63.03 / 64.27 ±1.11 / 65.85 ms │     no change │
│ QQuery 39 │     450.63 / 455.74 ±3.98 / 460.52 ms │     456.69 / 463.58 ±5.33 / 471.56 ms │     no change │
│ QQuery 40 │        23.43 / 23.87 ±0.60 / 25.05 ms │        22.94 / 23.19 ±0.22 / 23.60 ms │     no change │
│ QQuery 41 │        11.18 / 11.38 ±0.20 / 11.76 ms │        11.45 / 11.57 ±0.13 / 11.80 ms │     no change │
│ QQuery 42 │        23.87 / 24.10 ±0.18 / 24.36 ms │        23.72 / 23.98 ±0.22 / 24.37 ms │     no change │
│ QQuery 43 │           4.79 / 4.89 ±0.15 / 5.18 ms │           4.84 / 4.94 ±0.10 / 5.14 ms │     no change │
│ QQuery 44 │        10.65 / 10.72 ±0.06 / 10.79 ms │        10.77 / 10.84 ±0.04 / 10.87 ms │     no change │
│ QQuery 45 │        38.09 / 39.80 ±2.85 / 45.48 ms │        38.30 / 40.94 ±2.77 / 44.96 ms │     no change │
│ QQuery 46 │        11.45 / 11.64 ±0.13 / 11.84 ms │        11.83 / 12.10 ±0.30 / 12.57 ms │     no change │
│ QQuery 47 │     228.88 / 231.25 ±2.09 / 234.84 ms │     230.39 / 235.31 ±5.12 / 244.83 ms │     no change │
│ QQuery 48 │        95.85 / 96.85 ±0.56 / 97.40 ms │        96.40 / 96.97 ±0.62 / 98.11 ms │     no change │
│ QQuery 49 │        76.89 / 79.24 ±2.19 / 82.70 ms │        77.34 / 79.29 ±2.49 / 84.10 ms │     no change │
│ QQuery 50 │        59.70 / 60.37 ±0.81 / 61.90 ms │        59.23 / 60.08 ±0.59 / 60.82 ms │     no change │
│ QQuery 51 │        90.55 / 92.71 ±1.86 / 95.95 ms │        91.11 / 93.30 ±1.54 / 95.47 ms │     no change │
│ QQuery 52 │        24.05 / 25.58 ±2.74 / 31.06 ms │        23.83 / 24.35 ±0.45 / 25.01 ms │     no change │
│ QQuery 53 │        29.34 / 32.19 ±4.82 / 41.81 ms │        30.21 / 30.65 ±0.31 / 31.11 ms │     no change │
│ QQuery 54 │        55.50 / 56.02 ±0.52 / 56.96 ms │        55.66 / 55.96 ±0.23 / 56.24 ms │     no change │
│ QQuery 55 │        23.34 / 23.66 ±0.30 / 24.20 ms │        23.55 / 23.79 ±0.25 / 24.27 ms │     no change │
│ QQuery 56 │        38.94 / 39.39 ±0.39 / 40.00 ms │        38.97 / 39.67 ±0.58 / 40.61 ms │     no change │
│ QQuery 57 │     176.48 / 178.94 ±3.62 / 186.12 ms │     177.47 / 179.15 ±2.10 / 183.24 ms │     no change │
│ QQuery 58 │     116.27 / 117.90 ±1.85 / 121.31 ms │     118.67 / 120.50 ±1.82 / 123.50 ms │     no change │
│ QQuery 59 │     115.87 / 116.84 ±0.49 / 117.21 ms │     117.92 / 118.72 ±0.48 / 119.29 ms │     no change │
│ QQuery 60 │        39.31 / 39.65 ±0.25 / 40.04 ms │        39.24 / 41.62 ±2.34 / 46.08 ms │     no change │
│ QQuery 61 │        12.76 / 13.07 ±0.18 / 13.28 ms │        13.27 / 13.45 ±0.19 / 13.80 ms │     no change │
│ QQuery 62 │        46.36 / 47.77 ±2.32 / 52.41 ms │        46.11 / 46.61 ±0.30 / 46.99 ms │     no change │
│ QQuery 63 │        30.02 / 30.78 ±0.47 / 31.26 ms │        29.88 / 30.06 ±0.13 / 30.28 ms │     no change │
│ QQuery 64 │     397.24 / 408.19 ±6.43 / 415.10 ms │     397.61 / 402.08 ±4.73 / 408.42 ms │     no change │
│ QQuery 65 │     143.52 / 148.69 ±3.03 / 152.96 ms │     143.99 / 149.24 ±2.95 / 152.12 ms │     no change │
│ QQuery 66 │        78.71 / 79.62 ±0.79 / 80.99 ms │        79.17 / 79.83 ±0.41 / 80.25 ms │     no change │
│ QQuery 67 │     249.66 / 255.71 ±4.77 / 262.11 ms │     243.87 / 251.95 ±5.88 / 258.56 ms │     no change │
│ QQuery 68 │        11.77 / 11.92 ±0.11 / 12.06 ms │        11.92 / 12.17 ±0.25 / 12.63 ms │     no change │
│ QQuery 69 │        63.30 / 66.88 ±3.53 / 72.74 ms │        62.63 / 63.63 ±1.52 / 66.65 ms │     no change │
│ QQuery 70 │     106.55 / 111.23 ±5.22 / 117.87 ms │     105.63 / 107.04 ±1.12 / 108.50 ms │     no change │
│ QQuery 71 │        35.54 / 35.85 ±0.26 / 36.28 ms │        36.04 / 38.98 ±5.25 / 49.48 ms │  1.09x slower │
│ QQuery 72 │ 2152.91 / 2208.05 ±52.28 / 2297.72 ms │ 2066.94 / 2196.25 ±86.55 / 2324.75 ms │     no change │
│ QQuery 73 │          9.45 / 9.86 ±0.54 / 10.88 ms │         9.34 / 10.51 ±1.81 / 14.09 ms │  1.07x slower │
│ QQuery 74 │     169.04 / 173.80 ±3.64 / 179.95 ms │     170.06 / 175.17 ±4.88 / 184.41 ms │     no change │
│ QQuery 75 │     147.29 / 152.50 ±5.35 / 159.78 ms │     147.84 / 149.48 ±0.96 / 150.56 ms │     no change │
│ QQuery 76 │        35.34 / 35.68 ±0.21 / 35.92 ms │        35.19 / 35.38 ±0.20 / 35.72 ms │     no change │
│ QQuery 77 │        60.99 / 63.97 ±3.35 / 70.46 ms │        61.30 / 65.76 ±3.82 / 72.05 ms │     no change │
│ QQuery 78 │     189.39 / 195.43 ±5.19 / 204.20 ms │     189.88 / 195.24 ±3.99 / 200.00 ms │     no change │
│ QQuery 79 │        67.16 / 67.60 ±0.23 / 67.82 ms │        67.39 / 67.96 ±0.57 / 68.89 ms │     no change │
│ QQuery 80 │     101.70 / 103.58 ±1.34 / 105.32 ms │     101.16 / 103.16 ±2.23 / 107.45 ms │     no change │
│ QQuery 81 │        25.56 / 29.77 ±4.50 / 36.08 ms │        25.93 / 28.22 ±3.90 / 36.00 ms │ +1.05x faster │
│ QQuery 82 │        16.37 / 16.79 ±0.24 / 17.00 ms │        16.63 / 16.98 ±0.43 / 17.83 ms │     no change │
│ QQuery 83 │        40.21 / 40.47 ±0.27 / 40.86 ms │        40.14 / 40.70 ±0.60 / 41.81 ms │     no change │
│ QQuery 84 │        34.51 / 34.69 ±0.11 / 34.81 ms │        34.16 / 34.66 ±0.26 / 34.83 ms │     no change │
│ QQuery 85 │     108.64 / 112.89 ±3.26 / 116.77 ms │     108.37 / 112.43 ±3.77 / 118.61 ms │     no change │
│ QQuery 86 │        24.95 / 25.48 ±0.56 / 26.56 ms │        25.18 / 25.69 ±0.38 / 26.16 ms │     no change │
│ QQuery 87 │        64.70 / 65.38 ±0.82 / 66.88 ms │        63.72 / 64.99 ±0.97 / 66.42 ms │     no change │
│ QQuery 88 │        62.60 / 64.61 ±2.38 / 69.28 ms │        61.72 / 62.37 ±0.60 / 63.25 ms │     no change │
│ QQuery 89 │        36.67 / 38.82 ±2.44 / 42.21 ms │        36.56 / 40.19 ±3.70 / 44.99 ms │     no change │
│ QQuery 90 │        17.12 / 17.27 ±0.22 / 17.71 ms │        16.97 / 17.17 ±0.22 / 17.52 ms │     no change │
│ QQuery 91 │        45.14 / 45.65 ±0.59 / 46.74 ms │        45.11 / 45.92 ±0.43 / 46.25 ms │     no change │
│ QQuery 92 │        30.03 / 30.45 ±0.39 / 31.10 ms │        30.15 / 30.41 ±0.24 / 30.84 ms │     no change │
│ QQuery 93 │        50.87 / 52.28 ±1.02 / 53.43 ms │        50.97 / 51.93 ±0.91 / 53.06 ms │     no change │
│ QQuery 94 │        39.32 / 40.51 ±1.57 / 43.55 ms │        38.45 / 41.05 ±2.78 / 44.75 ms │     no change │
│ QQuery 95 │        85.11 / 87.23 ±1.64 / 89.91 ms │        85.66 / 86.80 ±1.24 / 89.09 ms │     no change │
│ QQuery 96 │        24.12 / 24.43 ±0.32 / 25.03 ms │        24.47 / 24.88 ±0.32 / 25.27 ms │     no change │
│ QQuery 97 │        46.08 / 46.99 ±1.44 / 49.85 ms │        46.31 / 46.64 ±0.23 / 46.87 ms │     no change │
│ QQuery 98 │        42.92 / 44.38 ±0.99 / 45.67 ms │        43.75 / 44.47 ±0.55 / 45.23 ms │     no change │
│ QQuery 99 │        69.69 / 70.28 ±0.39 / 70.82 ms │        70.12 / 70.56 ±0.35 / 70.98 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 10508.09ms │
│ Total Time (lift-selectivity-stats)   │ 10507.70ms │
│ Average Time (HEAD)                   │   106.14ms │
│ Average Time (lift-selectivity-stats) │   106.14ms │
│ Queries Faster                        │          1 │
│ Queries Slower                        │          2 │
│ Queries with No Change                │         96 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric Value
Wall time 55.0s
Peak memory 7.0 GiB
Avg memory 6.2 GiB
CPU user 239.3s
CPU sys 6.9s
Peak spill 0 B

tpcds — branch

Metric Value
Wall time 55.0s
Peak memory 7.1 GiB
Avg memory 6.3 GiB
CPU user 237.8s
CPU sys 6.4s
Peak spill 0 B

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.18 / 3.37 ±4.28 / 11.93 ms │          1.17 / 3.38 ±4.30 / 11.99 ms │     no change │
│ QQuery 1  │        12.25 / 12.75 ±0.34 / 13.16 ms │        12.30 / 12.63 ±0.19 / 12.85 ms │     no change │
│ QQuery 2  │        35.68 / 36.24 ±0.65 / 37.51 ms │        35.39 / 35.89 ±0.35 / 36.49 ms │     no change │
│ QQuery 3  │        31.44 / 32.49 ±0.95 / 34.11 ms │        30.95 / 31.15 ±0.15 / 31.35 ms │     no change │
│ QQuery 4  │     232.58 / 234.40 ±2.57 / 239.43 ms │     229.42 / 234.55 ±3.29 / 239.12 ms │     no change │
│ QQuery 5  │     277.10 / 279.65 ±2.11 / 282.96 ms │     280.52 / 282.27 ±1.48 / 284.81 ms │     no change │
│ QQuery 6  │           1.19 / 1.33 ±0.22 / 1.78 ms │           1.19 / 1.34 ±0.24 / 1.83 ms │     no change │
│ QQuery 7  │        13.60 / 13.71 ±0.07 / 13.80 ms │        13.65 / 13.82 ±0.11 / 13.97 ms │     no change │
│ QQuery 8  │     325.45 / 330.75 ±3.56 / 334.04 ms │     332.21 / 333.67 ±1.04 / 335.12 ms │     no change │
│ QQuery 9  │     466.36 / 472.90 ±5.65 / 481.40 ms │     469.80 / 483.74 ±9.77 / 498.56 ms │     no change │
│ QQuery 10 │        70.38 / 71.34 ±0.65 / 72.31 ms │        70.92 / 77.00 ±8.62 / 94.01 ms │  1.08x slower │
│ QQuery 11 │        81.83 / 84.98 ±5.16 / 95.27 ms │        82.03 / 83.31 ±1.05 / 85.01 ms │     no change │
│ QQuery 12 │     271.56 / 276.31 ±4.13 / 281.35 ms │     274.63 / 282.71 ±5.37 / 289.85 ms │     no change │
│ QQuery 13 │     378.59 / 391.29 ±9.15 / 405.80 ms │     380.03 / 385.98 ±7.89 / 400.66 ms │     no change │
│ QQuery 14 │    286.66 / 298.31 ±10.58 / 315.00 ms │    291.19 / 305.57 ±15.54 / 334.61 ms │     no change │
│ QQuery 15 │     274.09 / 280.46 ±4.13 / 285.50 ms │    279.63 / 297.92 ±22.65 / 341.50 ms │  1.06x slower │
│ QQuery 16 │     626.69 / 636.47 ±7.96 / 648.93 ms │    624.05 / 642.15 ±11.84 / 660.82 ms │     no change │
│ QQuery 17 │     629.32 / 644.33 ±9.03 / 652.67 ms │    636.04 / 648.91 ±10.53 / 666.05 ms │     no change │
│ QQuery 18 │ 1275.69 / 1300.83 ±17.59 / 1319.77 ms │ 1290.19 / 1311.60 ±16.32 / 1334.63 ms │     no change │
│ QQuery 19 │        27.74 / 29.70 ±3.07 / 35.80 ms │        27.98 / 31.11 ±5.57 / 42.22 ms │     no change │
│ QQuery 20 │    522.94 / 542.46 ±16.87 / 569.25 ms │     522.83 / 532.03 ±7.87 / 542.73 ms │     no change │
│ QQuery 21 │     595.89 / 604.86 ±6.29 / 614.16 ms │     596.79 / 605.63 ±9.20 / 621.60 ms │     no change │
│ QQuery 22 │  1078.45 / 1088.12 ±6.48 / 1096.06 ms │ 1073.11 / 1084.34 ±10.73 / 1104.12 ms │     no change │
│ QQuery 23 │ 3251.99 / 3297.64 ±32.24 / 3342.42 ms │ 3261.45 / 3292.63 ±19.12 / 3319.73 ms │     no change │
│ QQuery 24 │        41.85 / 43.84 ±3.53 / 50.89 ms │        42.57 / 44.58 ±1.93 / 47.09 ms │     no change │
│ QQuery 25 │     112.07 / 114.18 ±1.66 / 116.02 ms │    112.19 / 122.12 ±14.15 / 149.26 ms │  1.07x slower │
│ QQuery 26 │        41.98 / 44.79 ±2.47 / 47.86 ms │        42.45 / 44.64 ±2.90 / 50.26 ms │     no change │
│ QQuery 27 │     669.13 / 681.49 ±8.32 / 694.31 ms │     681.67 / 685.68 ±2.37 / 688.32 ms │     no change │
│ QQuery 28 │ 3068.44 / 3102.79 ±18.08 / 3117.97 ms │ 3063.72 / 3084.14 ±28.99 / 3141.29 ms │     no change │
│ QQuery 29 │        40.45 / 45.87 ±5.97 / 53.88 ms │       40.54 / 46.27 ±10.61 / 67.46 ms │     no change │
│ QQuery 30 │     305.61 / 313.89 ±4.38 / 317.61 ms │    305.09 / 321.45 ±10.30 / 333.93 ms │     no change │
│ QQuery 31 │     291.34 / 298.47 ±6.87 / 309.26 ms │     300.32 / 307.90 ±6.71 / 316.02 ms │     no change │
│ QQuery 32 │   958.58 / 997.66 ±26.08 / 1035.11 ms │  999.36 / 1042.47 ±29.48 / 1086.19 ms │     no change │
│ QQuery 33 │ 1503.09 / 1535.50 ±28.03 / 1569.85 ms │ 1502.73 / 1532.15 ±27.30 / 1579.36 ms │     no change │
│ QQuery 34 │ 1535.22 / 1565.03 ±26.21 / 1609.06 ms │ 1509.49 / 1556.33 ±30.97 / 1606.83 ms │     no change │
│ QQuery 35 │    293.25 / 319.09 ±31.85 / 371.80 ms │    288.65 / 302.89 ±14.68 / 328.68 ms │ +1.05x faster │
│ QQuery 36 │        68.12 / 75.88 ±7.47 / 85.25 ms │       66.25 / 77.04 ±10.43 / 96.38 ms │     no change │
│ QQuery 37 │        36.14 / 40.14 ±4.70 / 47.04 ms │        35.82 / 38.61 ±3.93 / 46.32 ms │     no change │
│ QQuery 38 │        40.78 / 45.97 ±6.39 / 58.57 ms │        42.98 / 45.67 ±3.80 / 52.96 ms │     no change │
│ QQuery 39 │    142.78 / 159.37 ±14.50 / 185.45 ms │     155.58 / 160.28 ±4.45 / 167.13 ms │     no change │
│ QQuery 40 │        13.94 / 16.25 ±3.90 / 24.04 ms │        14.02 / 14.52 ±0.36 / 14.98 ms │ +1.12x faster │
│ QQuery 41 │        13.91 / 14.48 ±0.55 / 15.45 ms │        13.62 / 15.62 ±3.61 / 22.82 ms │  1.08x slower │
│ QQuery 42 │        13.17 / 15.15 ±3.38 / 21.90 ms │        13.28 / 13.47 ±0.19 / 13.82 ms │ +1.12x faster │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 20394.52ms │
│ Total Time (lift-selectivity-stats)   │ 20469.15ms │
│ Average Time (HEAD)                   │   474.29ms │
│ Average Time (lift-selectivity-stats) │   476.03ms │
│ Queries Faster                        │          3 │
│ Queries Slower                        │          4 │
│ Queries with No Change                │         36 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric Value
Wall time 105.0s
Peak memory 30.7 GiB
Avg memory 23.3 GiB
CPU user 1042.7s
CPU sys 80.5s
Peak spill 0 B

clickbench_partitioned — branch

Metric Value
Wall time 105.0s
Peak memory 30.5 GiB
Avg memory 22.9 GiB
CPU user 1044.9s
CPU sys 81.6s
Peak spill 0 B

File an issue against this benchmark runner

@adriangb

adriangb commented Jun 1, 2026

Copy link
Copy Markdown
Contributor Author

run benchmarks

env:
  DATAFUSION_EXECUTION_ADAPTIVE_FILTER_REORDERING: true

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4593483789-393-l7kzn 6.12.68+ #1 SMP Wed Apr 1 02:23:28 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (5e71ea4) to 85bc5ef (merge-base) diff using: tpcds
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4593483789-394-mr9rk 6.12.68+ #1 SMP Wed Apr 1 02:23:28 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (5e71ea4) to 85bc5ef (merge-base) diff using: tpch
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4593483789-392-9q7fl 6.12.68+ #1 SMP Wed Apr 1 02:23:28 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (5e71ea4) to 85bc5ef (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark tpch_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                           HEAD ┃         lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │ 37.96 / 39.80 ±2.12 / 43.53 ms │ 38.10 / 38.80 ±1.03 / 40.85 ms │     no change │
│ QQuery 2  │ 18.55 / 20.75 ±3.43 / 27.49 ms │ 18.08 / 18.65 ±0.46 / 19.36 ms │ +1.11x faster │
│ QQuery 3  │ 30.88 / 32.37 ±1.26 / 34.09 ms │ 30.43 / 33.54 ±1.75 / 35.84 ms │     no change │
│ QQuery 4  │ 16.88 / 17.47 ±0.67 / 18.77 ms │ 17.13 / 17.26 ±0.11 / 17.46 ms │     no change │
│ QQuery 5  │ 37.22 / 39.35 ±1.43 / 40.64 ms │ 37.75 / 39.31 ±1.31 / 40.66 ms │     no change │
│ QQuery 6  │ 15.90 / 16.09 ±0.21 / 16.46 ms │ 15.84 / 16.12 ±0.29 / 16.61 ms │     no change │
│ QQuery 7  │ 42.77 / 44.59 ±1.18 / 46.04 ms │ 44.74 / 46.06 ±1.02 / 47.08 ms │     no change │
│ QQuery 8  │ 42.30 / 42.69 ±0.26 / 43.01 ms │ 42.36 / 42.66 ±0.17 / 42.88 ms │     no change │
│ QQuery 9  │ 48.06 / 49.59 ±0.84 / 50.60 ms │ 48.35 / 49.28 ±0.70 / 50.50 ms │     no change │
│ QQuery 10 │ 41.60 / 42.05 ±0.60 / 43.25 ms │ 41.76 / 42.26 ±0.47 / 43.16 ms │     no change │
│ QQuery 11 │ 12.89 / 13.10 ±0.11 / 13.20 ms │ 12.88 / 13.05 ±0.15 / 13.31 ms │     no change │
│ QQuery 12 │ 23.80 / 24.70 ±0.77 / 25.87 ms │ 20.58 / 20.86 ±0.18 / 21.11 ms │ +1.18x faster │
│ QQuery 13 │ 31.71 / 33.06 ±1.31 / 35.55 ms │ 32.23 / 33.56 ±1.16 / 35.41 ms │     no change │
│ QQuery 14 │ 23.35 / 23.61 ±0.17 / 23.87 ms │ 23.44 / 23.66 ±0.15 / 23.84 ms │     no change │
│ QQuery 15 │ 30.82 / 31.50 ±0.88 / 33.21 ms │ 31.67 / 32.31 ±0.80 / 33.86 ms │     no change │
│ QQuery 16 │ 13.76 / 14.07 ±0.18 / 14.30 ms │ 14.24 / 14.38 ±0.10 / 14.54 ms │     no change │
│ QQuery 17 │ 72.86 / 73.25 ±0.22 / 73.56 ms │ 71.84 / 73.39 ±1.16 / 75.41 ms │     no change │
│ QQuery 18 │ 57.52 / 59.20 ±1.18 / 60.76 ms │ 57.76 / 59.08 ±1.02 / 60.73 ms │     no change │
│ QQuery 19 │ 32.60 / 33.21 ±0.77 / 34.73 ms │ 32.53 / 33.36 ±0.78 / 34.71 ms │     no change │
│ QQuery 20 │ 31.65 / 31.74 ±0.05 / 31.79 ms │ 31.87 / 32.07 ±0.19 / 32.34 ms │     no change │
│ QQuery 21 │ 55.22 / 56.42 ±1.08 / 58.30 ms │ 54.62 / 56.95 ±1.67 / 58.98 ms │     no change │
│ QQuery 22 │ 13.94 / 14.33 ±0.49 / 15.17 ms │ 13.72 / 13.79 ±0.06 / 13.89 ms │     no change │
└───────────┴────────────────────────────────┴────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━┓
┃ Benchmark Summary                     ┃          ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 752.95ms │
│ Total Time (lift-selectivity-stats)   │ 750.40ms │
│ Average Time (HEAD)                   │  34.22ms │
│ Average Time (lift-selectivity-stats) │  34.11ms │
│ Queries Faster                        │        2 │
│ Queries Slower                        │        0 │
│ Queries with No Change                │       20 │
│ Queries with Failure                  │        0 │
└───────────────────────────────────────┴──────────┘

Resource Usage

tpch — base (merge-base)

Metric Value
Wall time 5.0s
Peak memory 5.7 GiB
Avg memory 4.9 GiB
CPU user 29.4s
CPU sys 2.2s
Peak spill 0 B

tpch — branch

Metric Value
Wall time 5.0s
Peak memory 5.7 GiB
Avg memory 4.9 GiB
CPU user 29.5s
CPU sys 2.1s
Peak spill 0 B

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │           5.87 / 6.39 ±0.91 / 8.20 ms │           5.81 / 6.29 ±0.85 / 7.98 ms │     no change │
│ QQuery 2  │        80.40 / 80.49 ±0.14 / 80.77 ms │        80.75 / 80.99 ±0.21 / 81.29 ms │     no change │
│ QQuery 3  │        28.90 / 29.42 ±0.27 / 29.67 ms │        29.27 / 29.51 ±0.23 / 29.93 ms │     no change │
│ QQuery 4  │     497.61 / 504.12 ±5.31 / 513.47 ms │     502.60 / 510.32 ±6.55 / 518.41 ms │     no change │
│ QQuery 5  │        51.86 / 52.26 ±0.48 / 53.16 ms │        53.08 / 53.48 ±0.48 / 54.42 ms │     no change │
│ QQuery 6  │        36.63 / 37.03 ±0.27 / 37.44 ms │        36.43 / 37.16 ±0.40 / 37.49 ms │     no change │
│ QQuery 7  │       95.79 / 97.23 ±1.93 / 100.85 ms │       97.34 / 99.85 ±3.46 / 106.65 ms │     no change │
│ QQuery 8  │        37.66 / 38.25 ±0.43 / 38.91 ms │        37.69 / 37.94 ±0.16 / 38.15 ms │     no change │
│ QQuery 9  │        52.98 / 55.00 ±1.24 / 56.50 ms │        54.86 / 55.82 ±0.87 / 57.26 ms │     no change │
│ QQuery 10 │        68.86 / 69.26 ±0.29 / 69.78 ms │        69.27 / 69.77 ±0.29 / 70.12 ms │     no change │
│ QQuery 11 │     300.92 / 310.94 ±6.18 / 319.37 ms │     313.07 / 317.61 ±2.62 / 320.30 ms │     no change │
│ QQuery 12 │        28.84 / 29.28 ±0.29 / 29.74 ms │        29.19 / 29.54 ±0.25 / 29.95 ms │     no change │
│ QQuery 13 │     119.21 / 120.87 ±1.80 / 123.82 ms │     121.76 / 123.99 ±3.37 / 130.67 ms │     no change │
│ QQuery 14 │     505.56 / 508.47 ±2.72 / 512.47 ms │     513.36 / 517.06 ±2.91 / 521.43 ms │     no change │
│ QQuery 15 │        59.55 / 60.52 ±0.67 / 61.58 ms │        61.32 / 63.83 ±3.07 / 68.87 ms │  1.05x slower │
│ QQuery 16 │           6.83 / 7.02 ±0.24 / 7.45 ms │           6.88 / 7.04 ±0.16 / 7.34 ms │     no change │
│ QQuery 17 │        82.21 / 84.80 ±4.22 / 93.21 ms │        83.09 / 83.78 ±0.86 / 85.44 ms │     no change │
│ QQuery 18 │     126.52 / 127.70 ±1.18 / 129.44 ms │     128.02 / 129.38 ±1.46 / 132.09 ms │     no change │
│ QQuery 19 │        42.33 / 42.46 ±0.08 / 42.58 ms │        43.01 / 43.25 ±0.30 / 43.84 ms │     no change │
│ QQuery 20 │        36.27 / 36.77 ±0.28 / 37.06 ms │        35.76 / 36.42 ±0.39 / 36.94 ms │     no change │
│ QQuery 21 │        17.21 / 17.42 ±0.15 / 17.61 ms │        17.11 / 17.29 ±0.16 / 17.53 ms │     no change │
│ QQuery 22 │        63.20 / 63.59 ±0.28 / 64.00 ms │        62.45 / 62.87 ±0.22 / 63.06 ms │     no change │
│ QQuery 23 │     346.34 / 351.02 ±4.76 / 357.49 ms │     343.95 / 350.73 ±4.87 / 358.19 ms │     no change │
│ QQuery 24 │     225.98 / 228.74 ±4.17 / 236.96 ms │     225.35 / 230.30 ±4.32 / 235.86 ms │     no change │
│ QQuery 25 │     114.63 / 117.57 ±2.41 / 120.86 ms │     114.18 / 115.52 ±1.45 / 118.21 ms │     no change │
│ QQuery 26 │        58.62 / 58.98 ±0.35 / 59.53 ms │        59.17 / 59.56 ±0.36 / 60.08 ms │     no change │
│ QQuery 27 │           6.63 / 6.79 ±0.15 / 7.08 ms │           6.54 / 6.69 ±0.16 / 6.99 ms │     no change │
│ QQuery 28 │        56.40 / 60.62 ±2.12 / 62.02 ms │        56.83 / 61.81 ±5.10 / 71.05 ms │     no change │
│ QQuery 29 │      97.97 / 101.08 ±3.16 / 107.13 ms │       97.94 / 99.09 ±0.91 / 100.43 ms │     no change │
│ QQuery 30 │        32.49 / 32.89 ±0.29 / 33.31 ms │        32.79 / 33.17 ±0.30 / 33.62 ms │     no change │
│ QQuery 31 │     112.37 / 114.23 ±2.24 / 118.32 ms │     113.08 / 115.06 ±2.13 / 119.01 ms │     no change │
│ QQuery 32 │        20.34 / 20.81 ±0.34 / 21.34 ms │        20.32 / 20.59 ±0.23 / 21.01 ms │     no change │
│ QQuery 33 │        39.08 / 39.25 ±0.17 / 39.58 ms │        38.38 / 38.82 ±0.41 / 39.49 ms │     no change │
│ QQuery 34 │          9.47 / 9.97 ±0.38 / 10.37 ms │           9.54 / 9.70 ±0.15 / 9.93 ms │     no change │
│ QQuery 35 │        77.70 / 78.68 ±0.94 / 79.98 ms │        77.14 / 79.59 ±2.51 / 84.00 ms │     no change │
│ QQuery 36 │           5.97 / 6.11 ±0.17 / 6.42 ms │           5.95 / 6.06 ±0.17 / 6.40 ms │     no change │
│ QQuery 37 │          7.04 / 8.13 ±1.99 / 12.11 ms │           6.83 / 6.99 ±0.11 / 7.18 ms │ +1.16x faster │
│ QQuery 38 │        64.14 / 64.86 ±1.02 / 66.83 ms │        62.56 / 64.16 ±1.00 / 65.57 ms │     no change │
│ QQuery 39 │     454.99 / 461.63 ±5.33 / 468.22 ms │     456.56 / 460.55 ±2.93 / 463.53 ms │     no change │
│ QQuery 40 │        23.34 / 23.61 ±0.38 / 24.35 ms │        23.33 / 23.48 ±0.09 / 23.60 ms │     no change │
│ QQuery 41 │        11.38 / 11.61 ±0.27 / 12.13 ms │        11.43 / 11.60 ±0.19 / 11.95 ms │     no change │
│ QQuery 42 │        24.04 / 24.22 ±0.12 / 24.38 ms │        23.93 / 24.33 ±0.34 / 24.91 ms │     no change │
│ QQuery 43 │          4.82 / 6.60 ±3.20 / 12.99 ms │           4.88 / 5.00 ±0.13 / 5.22 ms │ +1.32x faster │
│ QQuery 44 │        10.95 / 12.42 ±2.40 / 17.21 ms │        10.86 / 11.04 ±0.15 / 11.23 ms │ +1.12x faster │
│ QQuery 45 │        39.23 / 40.18 ±0.72 / 41.18 ms │        39.16 / 39.72 ±0.43 / 40.30 ms │     no change │
│ QQuery 46 │        11.88 / 12.31 ±0.66 / 13.63 ms │        11.66 / 11.98 ±0.23 / 12.37 ms │     no change │
│ QQuery 47 │     233.58 / 238.95 ±4.87 / 247.21 ms │     232.33 / 235.60 ±2.69 / 238.66 ms │     no change │
│ QQuery 48 │        96.95 / 97.23 ±0.20 / 97.51 ms │      98.22 / 100.30 ±3.20 / 106.68 ms │     no change │
│ QQuery 49 │        78.34 / 79.35 ±1.82 / 82.98 ms │        78.23 / 79.22 ±1.00 / 80.52 ms │     no change │
│ QQuery 50 │        59.63 / 60.05 ±0.31 / 60.45 ms │        59.90 / 60.31 ±0.22 / 60.51 ms │     no change │
│ QQuery 51 │       95.27 / 97.53 ±1.83 / 100.36 ms │        93.71 / 95.97 ±1.64 / 98.38 ms │     no change │
│ QQuery 52 │        24.45 / 24.79 ±0.24 / 25.04 ms │        24.85 / 25.13 ±0.34 / 25.73 ms │     no change │
│ QQuery 53 │        29.90 / 30.02 ±0.12 / 30.20 ms │        30.02 / 30.39 ±0.29 / 30.92 ms │     no change │
│ QQuery 54 │        55.97 / 56.33 ±0.35 / 56.98 ms │        55.59 / 56.24 ±0.55 / 56.90 ms │     no change │
│ QQuery 55 │        23.80 / 24.08 ±0.18 / 24.30 ms │        23.79 / 23.90 ±0.09 / 24.04 ms │     no change │
│ QQuery 56 │        39.22 / 43.08 ±7.12 / 57.30 ms │        38.95 / 40.24 ±1.97 / 44.15 ms │ +1.07x faster │
│ QQuery 57 │     175.50 / 177.47 ±1.25 / 179.15 ms │     176.75 / 179.15 ±2.80 / 183.77 ms │     no change │
│ QQuery 58 │     116.37 / 119.38 ±3.49 / 125.78 ms │     119.10 / 119.99 ±1.43 / 122.83 ms │     no change │
│ QQuery 59 │     117.57 / 119.26 ±2.87 / 124.98 ms │     118.04 / 120.81 ±3.32 / 127.20 ms │     no change │
│ QQuery 60 │        39.51 / 40.41 ±0.61 / 41.39 ms │        39.77 / 40.41 ±0.55 / 41.31 ms │     no change │
│ QQuery 61 │        13.05 / 13.27 ±0.22 / 13.62 ms │        13.14 / 13.21 ±0.08 / 13.34 ms │     no change │
│ QQuery 62 │        45.57 / 46.10 ±0.30 / 46.41 ms │        47.03 / 47.23 ±0.15 / 47.46 ms │     no change │
│ QQuery 63 │        29.78 / 30.23 ±0.36 / 30.87 ms │        30.16 / 30.40 ±0.17 / 30.56 ms │     no change │
│ QQuery 64 │    395.95 / 409.41 ±14.28 / 428.22 ms │     399.31 / 406.61 ±4.95 / 412.75 ms │     no change │
│ QQuery 65 │     145.01 / 151.75 ±5.59 / 159.22 ms │     145.19 / 151.83 ±3.59 / 155.38 ms │     no change │
│ QQuery 66 │        79.38 / 80.48 ±0.63 / 81.12 ms │        80.09 / 83.26 ±5.35 / 93.94 ms │     no change │
│ QQuery 67 │     251.95 / 258.05 ±4.50 / 265.60 ms │     253.13 / 258.35 ±5.51 / 267.75 ms │     no change │
│ QQuery 68 │        12.13 / 12.39 ±0.24 / 12.80 ms │        11.85 / 12.06 ±0.20 / 12.35 ms │     no change │
│ QQuery 69 │        62.88 / 63.56 ±0.67 / 64.77 ms │        63.16 / 63.60 ±0.33 / 64.15 ms │     no change │
│ QQuery 70 │    105.01 / 115.49 ±11.01 / 131.54 ms │     105.38 / 107.05 ±1.09 / 108.72 ms │ +1.08x faster │
│ QQuery 71 │        35.74 / 36.73 ±0.58 / 37.43 ms │        35.82 / 39.62 ±5.98 / 51.54 ms │  1.08x slower │
│ QQuery 72 │ 2100.01 / 2170.55 ±68.28 / 2259.28 ms │ 2153.09 / 2246.72 ±56.21 / 2326.64 ms │     no change │
│ QQuery 73 │           9.34 / 9.55 ±0.23 / 9.98 ms │         9.52 / 10.00 ±0.43 / 10.69 ms │     no change │
│ QQuery 74 │     172.82 / 176.91 ±3.37 / 182.62 ms │     177.00 / 180.93 ±6.55 / 193.98 ms │     no change │
│ QQuery 75 │     148.07 / 149.71 ±1.60 / 152.78 ms │     149.61 / 152.02 ±3.46 / 158.84 ms │     no change │
│ QQuery 76 │        35.49 / 35.81 ±0.17 / 35.95 ms │        35.78 / 36.11 ±0.27 / 36.51 ms │     no change │
│ QQuery 77 │        62.70 / 66.15 ±5.51 / 77.13 ms │        61.85 / 62.48 ±0.46 / 63.21 ms │ +1.06x faster │
│ QQuery 78 │     188.40 / 197.01 ±6.63 / 208.76 ms │     192.08 / 195.91 ±4.20 / 201.38 ms │     no change │
│ QQuery 79 │        67.20 / 67.66 ±0.25 / 67.90 ms │        67.73 / 69.63 ±3.17 / 75.93 ms │     no change │
│ QQuery 80 │     101.00 / 105.12 ±4.66 / 114.22 ms │     102.35 / 105.30 ±3.06 / 110.84 ms │     no change │
│ QQuery 81 │        25.82 / 26.32 ±0.42 / 27.09 ms │        25.81 / 26.05 ±0.17 / 26.24 ms │     no change │
│ QQuery 82 │        16.48 / 16.94 ±0.52 / 17.93 ms │        16.58 / 16.95 ±0.37 / 17.63 ms │     no change │
│ QQuery 83 │        39.93 / 40.37 ±0.42 / 40.97 ms │        40.33 / 43.03 ±3.63 / 50.18 ms │  1.07x slower │
│ QQuery 84 │        34.87 / 35.13 ±0.26 / 35.59 ms │        34.88 / 36.84 ±3.04 / 42.90 ms │     no change │
│ QQuery 85 │     109.72 / 114.76 ±5.78 / 125.66 ms │     109.54 / 110.92 ±1.09 / 112.65 ms │     no change │
│ QQuery 86 │        25.16 / 25.70 ±0.37 / 26.32 ms │        25.21 / 25.49 ±0.32 / 26.01 ms │     no change │
│ QQuery 87 │        64.77 / 65.42 ±0.37 / 65.85 ms │        64.13 / 66.74 ±2.80 / 71.92 ms │     no change │
│ QQuery 88 │        62.98 / 65.72 ±3.72 / 73.07 ms │        62.04 / 62.92 ±0.67 / 64.11 ms │     no change │
│ QQuery 89 │        36.27 / 37.03 ±0.52 / 37.76 ms │        36.03 / 36.65 ±0.37 / 37.17 ms │     no change │
│ QQuery 90 │        16.91 / 17.03 ±0.09 / 17.15 ms │        16.87 / 17.08 ±0.16 / 17.28 ms │     no change │
│ QQuery 91 │        44.58 / 45.77 ±0.63 / 46.43 ms │        45.19 / 46.70 ±2.23 / 51.12 ms │     no change │
│ QQuery 92 │        30.45 / 30.85 ±0.49 / 31.80 ms │        30.29 / 32.42 ±3.15 / 38.62 ms │  1.05x slower │
│ QQuery 93 │        51.25 / 52.29 ±1.24 / 54.66 ms │        51.71 / 52.79 ±0.82 / 54.23 ms │     no change │
│ QQuery 94 │        40.04 / 41.78 ±2.46 / 46.64 ms │        39.18 / 39.53 ±0.41 / 40.20 ms │ +1.06x faster │
│ QQuery 95 │        85.23 / 86.75 ±0.99 / 88.21 ms │        84.24 / 86.77 ±2.22 / 90.55 ms │     no change │
│ QQuery 96 │        24.61 / 24.81 ±0.18 / 25.02 ms │        24.37 / 25.25 ±1.30 / 27.80 ms │     no change │
│ QQuery 97 │        46.75 / 47.98 ±1.43 / 50.68 ms │        46.57 / 47.27 ±0.40 / 47.68 ms │     no change │
│ QQuery 98 │        44.15 / 45.97 ±2.70 / 51.30 ms │        43.22 / 43.51 ±0.45 / 44.40 ms │ +1.06x faster │
│ QQuery 99 │        69.47 / 70.02 ±0.39 / 70.61 ms │        70.80 / 71.13 ±0.30 / 71.69 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 10566.11ms │
│ Total Time (lift-selectivity-stats)   │ 10666.78ms │
│ Average Time (HEAD)                   │   106.73ms │
│ Average Time (lift-selectivity-stats) │   107.75ms │
│ Queries Faster                        │          8 │
│ Queries Slower                        │          4 │
│ Queries with No Change                │         87 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric Value
Wall time 55.0s
Peak memory 6.8 GiB
Avg memory 6.2 GiB
CPU user 240.4s
CPU sys 6.8s
Peak spill 0 B

tpcds — branch

Metric Value
Wall time 55.0s
Peak memory 6.8 GiB
Avg memory 6.3 GiB
CPU user 240.3s
CPU sys 6.7s
Peak spill 0 B

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.18 / 3.38 ±4.29 / 11.96 ms │          1.16 / 3.36 ±4.30 / 11.96 ms │     no change │
│ QQuery 1  │        12.46 / 12.64 ±0.12 / 12.79 ms │        12.11 / 12.44 ±0.21 / 12.68 ms │     no change │
│ QQuery 2  │        35.82 / 36.31 ±0.30 / 36.62 ms │        35.83 / 36.23 ±0.32 / 36.73 ms │     no change │
│ QQuery 3  │        30.74 / 31.31 ±0.56 / 32.37 ms │        30.63 / 31.42 ±0.76 / 32.67 ms │     no change │
│ QQuery 4  │     222.56 / 227.15 ±2.94 / 230.34 ms │     227.05 / 231.04 ±2.39 / 234.37 ms │     no change │
│ QQuery 5  │     273.77 / 277.27 ±2.74 / 280.77 ms │     274.01 / 279.53 ±4.36 / 287.26 ms │     no change │
│ QQuery 6  │           1.19 / 1.34 ±0.22 / 1.78 ms │           1.16 / 1.32 ±0.23 / 1.76 ms │     no change │
│ QQuery 7  │        13.66 / 13.87 ±0.15 / 14.09 ms │        13.48 / 13.61 ±0.10 / 13.74 ms │     no change │
│ QQuery 8  │     324.12 / 331.26 ±5.46 / 337.88 ms │     325.80 / 331.23 ±4.56 / 337.43 ms │     no change │
│ QQuery 9  │     457.86 / 469.37 ±8.98 / 479.74 ms │     458.88 / 473.14 ±9.98 / 488.38 ms │     no change │
│ QQuery 10 │        69.88 / 70.84 ±0.81 / 72.02 ms │        69.86 / 70.42 ±0.73 / 71.85 ms │     no change │
│ QQuery 11 │        82.23 / 84.94 ±4.30 / 93.50 ms │        80.67 / 84.22 ±2.40 / 88.21 ms │     no change │
│ QQuery 12 │     268.15 / 275.01 ±5.12 / 280.92 ms │     272.40 / 277.06 ±4.51 / 284.84 ms │     no change │
│ QQuery 13 │    368.44 / 386.64 ±15.50 / 407.35 ms │    365.72 / 380.17 ±11.85 / 394.45 ms │     no change │
│ QQuery 14 │     284.08 / 288.14 ±3.99 / 294.29 ms │     283.95 / 289.28 ±5.13 / 296.08 ms │     no change │
│ QQuery 15 │     270.76 / 281.64 ±8.50 / 293.21 ms │     273.57 / 282.06 ±5.47 / 290.40 ms │     no change │
│ QQuery 16 │    613.97 / 633.30 ±14.55 / 655.44 ms │    633.54 / 650.56 ±15.78 / 678.54 ms │     no change │
│ QQuery 17 │    627.57 / 642.36 ±15.38 / 672.10 ms │     633.27 / 638.88 ±5.81 / 647.16 ms │     no change │
│ QQuery 18 │ 1286.56 / 1298.98 ±17.57 / 1333.29 ms │ 1248.48 / 1308.50 ±52.34 / 1405.38 ms │     no change │
│ QQuery 19 │        27.83 / 28.50 ±0.48 / 29.16 ms │        27.71 / 30.58 ±4.74 / 40.04 ms │  1.07x slower │
│ QQuery 20 │    520.62 / 528.79 ±10.06 / 548.20 ms │    515.63 / 544.83 ±41.82 / 627.94 ms │     no change │
│ QQuery 21 │     597.20 / 600.28 ±2.46 / 603.86 ms │     596.67 / 603.40 ±8.35 / 619.49 ms │     no change │
│ QQuery 22 │ 1068.45 / 1082.17 ±11.96 / 1099.67 ms │ 1068.38 / 1078.88 ±10.23 / 1098.45 ms │     no change │
│ QQuery 23 │ 3204.98 / 3239.96 ±25.33 / 3275.62 ms │ 3225.26 / 3249.92 ±16.13 / 3274.56 ms │     no change │
│ QQuery 24 │        41.87 / 46.42 ±4.67 / 52.54 ms │        41.46 / 43.56 ±2.71 / 48.60 ms │ +1.07x faster │
│ QQuery 25 │     112.05 / 117.80 ±8.20 / 133.70 ms │     112.06 / 113.66 ±1.75 / 116.96 ms │     no change │
│ QQuery 26 │        42.08 / 43.69 ±2.18 / 47.97 ms │        42.10 / 43.36 ±1.27 / 45.40 ms │     no change │
│ QQuery 27 │     671.63 / 680.57 ±7.53 / 691.76 ms │     672.26 / 678.59 ±6.08 / 688.60 ms │     no change │
│ QQuery 28 │ 3022.68 / 3078.00 ±29.56 / 3107.44 ms │ 3056.17 / 3069.97 ±18.31 / 3105.62 ms │     no change │
│ QQuery 29 │        40.15 / 40.64 ±0.48 / 41.47 ms │        40.63 / 44.10 ±6.10 / 56.25 ms │  1.09x slower │
│ QQuery 30 │    304.75 / 319.07 ±14.91 / 340.83 ms │     301.49 / 307.64 ±4.70 / 314.84 ms │     no change │
│ QQuery 31 │     285.56 / 297.27 ±8.15 / 307.50 ms │     291.93 / 299.39 ±6.45 / 310.48 ms │     no change │
│ QQuery 32 │   941.65 / 983.57 ±26.84 / 1025.22 ms │   957.18 / 980.96 ±24.72 / 1020.41 ms │     no change │
│ QQuery 33 │  1489.03 / 1496.63 ±6.61 / 1507.09 ms │ 1490.78 / 1524.38 ±33.42 / 1578.29 ms │     no change │
│ QQuery 34 │ 1481.03 / 1511.74 ±23.60 / 1544.94 ms │ 1502.03 / 1529.85 ±18.33 / 1547.78 ms │     no change │
│ QQuery 35 │    282.21 / 310.20 ±41.19 / 392.08 ms │    280.58 / 302.49 ±21.35 / 333.66 ms │     no change │
│ QQuery 36 │        66.46 / 78.28 ±8.81 / 88.69 ms │        69.97 / 73.53 ±2.52 / 77.63 ms │ +1.06x faster │
│ QQuery 37 │        35.51 / 42.75 ±8.94 / 58.80 ms │        37.03 / 44.09 ±8.32 / 58.20 ms │     no change │
│ QQuery 38 │        41.20 / 46.42 ±5.24 / 56.11 ms │        43.07 / 43.93 ±0.97 / 45.41 ms │ +1.06x faster │
│ QQuery 39 │     154.92 / 156.59 ±1.82 / 159.88 ms │    131.67 / 154.07 ±13.24 / 173.15 ms │     no change │
│ QQuery 40 │        13.75 / 14.58 ±1.10 / 16.76 ms │        16.31 / 16.83 ±0.52 / 17.76 ms │  1.15x slower │
│ QQuery 41 │        13.37 / 14.24 ±1.34 / 16.90 ms │        15.84 / 16.31 ±0.57 / 17.36 ms │  1.15x slower │
│ QQuery 42 │        12.92 / 13.22 ±0.19 / 13.51 ms │        13.65 / 15.92 ±4.41 / 24.74 ms │  1.20x slower │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 20137.14ms │
│ Total Time (lift-selectivity-stats)   │ 20204.65ms │
│ Average Time (HEAD)                   │   468.31ms │
│ Average Time (lift-selectivity-stats) │   469.88ms │
│ Queries Faster                        │          3 │
│ Queries Slower                        │          5 │
│ Queries with No Change                │         35 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric Value
Wall time 105.0s
Peak memory 29.9 GiB
Avg memory 23.1 GiB
CPU user 1034.5s
CPU sys 75.8s
Peak spill 0 B

clickbench_partitioned — branch

Metric Value
Wall time 105.0s
Peak memory 30.1 GiB
Avg memory 22.9 GiB
CPU user 1034.7s
CPU sys 78.0s
Peak spill 0 B

File an issue against this benchmark runner

@adriangb

adriangb commented Jun 1, 2026

Copy link
Copy Markdown
Contributor Author

run benchmarks

env:
  DATAFUSION_EXECUTION_ADAPTIVE_FILTER_REORDERING: true

@adriangbot

Copy link
Copy Markdown

Benchmark for this request failed.

Last 20 lines of output:

Click to expand
Cloning into '/workspace/datafusion-branch'...
From https://github.com/apache/datafusion
 * [new ref]         refs/pull/22698/head -> lift-selectivity-stats
 * branch            main                 -> FETCH_HEAD
Switched to branch 'lift-selectivity-stats'
85bc5ef7473d608604dc2e8bd81184505a1f6c19
Cloning into '/workspace/datafusion-base'...
HEAD is now at 85bc5ef fix: Projection stats Absent for columns referenced >1 time (#22679)
rustc 1.95.0 (59807616e 2026-04-14)
5e71ea4bd7436e797944a53367c326ab4088dc93
85bc5ef7473d608604dc2e8bd81184505a1f6c19

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4593706269-396-l9mwp 6.12.68+ #1 SMP Wed Apr 1 02:23:28 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (5e71ea4) to 85bc5ef (merge-base) diff using: tpcds
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

Benchmark for this request failed.

Last 20 lines of output:

Click to expand
Cloning into '/workspace/datafusion-branch'...
From https://github.com/apache/datafusion
 * [new ref]         refs/pull/22698/head -> lift-selectivity-stats
 * branch            main                 -> FETCH_HEAD
Switched to branch 'lift-selectivity-stats'
85bc5ef7473d608604dc2e8bd81184505a1f6c19
Cloning into '/workspace/datafusion-base'...
HEAD is now at 85bc5ef fix: Projection stats Absent for columns referenced >1 time (#22679)
rustc 1.95.0 (59807616e 2026-04-14)
5e71ea4bd7436e797944a53367c326ab4088dc93
85bc5ef7473d608604dc2e8bd81184505a1f6c19

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │           5.82 / 6.31 ±0.87 / 8.05 ms │           5.93 / 6.42 ±0.80 / 8.01 ms │     no change │
│ QQuery 2  │        79.80 / 80.04 ±0.27 / 80.37 ms │        81.53 / 82.47 ±0.85 / 84.05 ms │     no change │
│ QQuery 3  │        28.89 / 29.05 ±0.14 / 29.27 ms │        29.20 / 29.56 ±0.24 / 29.82 ms │     no change │
│ QQuery 4  │     494.27 / 500.08 ±3.79 / 505.86 ms │     499.47 / 505.08 ±4.91 / 511.99 ms │     no change │
│ QQuery 5  │        51.25 / 51.62 ±0.44 / 52.47 ms │        51.44 / 52.04 ±0.34 / 52.40 ms │     no change │
│ QQuery 6  │        36.17 / 36.79 ±0.31 / 37.03 ms │        36.64 / 36.96 ±0.17 / 37.12 ms │     no change │
│ QQuery 7  │        95.33 / 95.90 ±0.50 / 96.62 ms │       96.38 / 98.83 ±2.80 / 104.28 ms │     no change │
│ QQuery 8  │        37.33 / 39.03 ±3.06 / 45.14 ms │        37.17 / 38.18 ±0.62 / 38.81 ms │     no change │
│ QQuery 9  │        54.23 / 55.76 ±1.33 / 57.97 ms │        52.93 / 57.22 ±2.52 / 60.63 ms │     no change │
│ QQuery 10 │        69.45 / 69.74 ±0.30 / 70.20 ms │        68.89 / 69.43 ±0.40 / 69.86 ms │     no change │
│ QQuery 11 │     311.56 / 315.99 ±3.44 / 320.23 ms │     307.05 / 309.09 ±1.23 / 310.26 ms │     no change │
│ QQuery 12 │        28.94 / 29.22 ±0.28 / 29.68 ms │        29.03 / 29.41 ±0.26 / 29.82 ms │     no change │
│ QQuery 13 │     119.05 / 120.71 ±1.59 / 123.35 ms │     120.30 / 121.32 ±0.60 / 122.00 ms │     no change │
│ QQuery 14 │     504.86 / 508.49 ±3.00 / 513.68 ms │     506.04 / 509.97 ±2.13 / 512.31 ms │     no change │
│ QQuery 15 │        58.80 / 60.48 ±1.32 / 61.94 ms │        59.47 / 62.60 ±2.62 / 66.62 ms │     no change │
│ QQuery 16 │          7.02 / 8.73 ±3.15 / 15.02 ms │           7.14 / 7.31 ±0.23 / 7.75 ms │ +1.19x faster │
│ QQuery 17 │        81.33 / 83.02 ±1.66 / 85.85 ms │        81.70 / 83.09 ±1.15 / 84.59 ms │     no change │
│ QQuery 18 │     126.69 / 127.94 ±1.71 / 131.24 ms │     129.83 / 131.09 ±1.00 / 132.43 ms │     no change │
│ QQuery 19 │        42.26 / 42.62 ±0.36 / 43.27 ms │        42.77 / 42.99 ±0.19 / 43.28 ms │     no change │
│ QQuery 20 │        36.09 / 36.34 ±0.22 / 36.64 ms │        35.85 / 36.54 ±0.56 / 37.21 ms │     no change │
│ QQuery 21 │        17.49 / 17.98 ±0.64 / 19.18 ms │        17.43 / 17.59 ±0.18 / 17.89 ms │     no change │
│ QQuery 22 │        64.51 / 64.98 ±0.58 / 66.09 ms │        62.65 / 64.23 ±0.95 / 65.31 ms │     no change │
│ QQuery 23 │     349.52 / 356.04 ±6.30 / 363.90 ms │     356.04 / 361.58 ±4.41 / 367.32 ms │     no change │
│ QQuery 24 │     226.34 / 230.94 ±6.39 / 243.55 ms │     227.31 / 232.72 ±7.61 / 247.68 ms │     no change │
│ QQuery 25 │     113.98 / 117.00 ±2.18 / 120.36 ms │     115.56 / 118.61 ±3.02 / 123.10 ms │     no change │
│ QQuery 26 │        58.71 / 59.17 ±0.56 / 60.20 ms │        60.79 / 61.26 ±0.38 / 61.73 ms │     no change │
│ QQuery 27 │           6.91 / 7.07 ±0.16 / 7.36 ms │           6.78 / 6.96 ±0.18 / 7.31 ms │     no change │
│ QQuery 28 │        60.56 / 62.92 ±4.24 / 71.40 ms │        60.21 / 65.07 ±4.80 / 74.30 ms │     no change │
│ QQuery 29 │      99.32 / 101.20 ±2.39 / 105.90 ms │     100.11 / 101.08 ±1.06 / 103.11 ms │     no change │
│ QQuery 30 │        33.17 / 33.39 ±0.17 / 33.67 ms │        34.02 / 34.71 ±0.43 / 35.35 ms │     no change │
│ QQuery 31 │     112.34 / 114.83 ±1.63 / 116.74 ms │     115.67 / 116.64 ±0.98 / 118.40 ms │     no change │
│ QQuery 32 │        21.55 / 21.65 ±0.06 / 21.74 ms │        21.24 / 21.57 ±0.34 / 22.14 ms │     no change │
│ QQuery 33 │        39.14 / 39.79 ±0.48 / 40.39 ms │        40.28 / 40.67 ±0.33 / 41.27 ms │     no change │
│ QQuery 34 │          9.74 / 9.89 ±0.12 / 10.07 ms │        10.01 / 10.21 ±0.23 / 10.64 ms │     no change │
│ QQuery 35 │        78.15 / 80.76 ±3.16 / 86.98 ms │        80.19 / 82.53 ±3.12 / 88.66 ms │     no change │
│ QQuery 36 │          6.09 / 7.66 ±3.00 / 13.67 ms │           6.46 / 6.57 ±0.10 / 6.73 ms │ +1.17x faster │
│ QQuery 37 │           7.03 / 7.26 ±0.16 / 7.48 ms │           7.40 / 7.60 ±0.18 / 7.81 ms │     no change │
│ QQuery 38 │        65.03 / 66.15 ±1.31 / 68.57 ms │        67.19 / 68.33 ±0.75 / 69.07 ms │     no change │
│ QQuery 39 │     454.35 / 467.85 ±8.59 / 478.34 ms │     470.33 / 479.43 ±6.34 / 489.37 ms │     no change │
│ QQuery 40 │        23.29 / 23.39 ±0.06 / 23.47 ms │        23.94 / 26.98 ±3.63 / 33.79 ms │  1.15x slower │
│ QQuery 41 │        11.39 / 11.56 ±0.32 / 12.20 ms │        11.75 / 11.85 ±0.15 / 12.14 ms │     no change │
│ QQuery 42 │        23.56 / 27.69 ±5.04 / 37.21 ms │        24.58 / 24.86 ±0.31 / 25.36 ms │ +1.11x faster │
│ QQuery 43 │           4.98 / 5.14 ±0.20 / 5.54 ms │           5.01 / 5.13 ±0.13 / 5.39 ms │     no change │
│ QQuery 44 │        10.85 / 11.19 ±0.46 / 12.02 ms │        10.92 / 11.13 ±0.19 / 11.45 ms │     no change │
│ QQuery 45 │        38.85 / 39.30 ±0.38 / 39.89 ms │        40.27 / 41.19 ±0.87 / 42.68 ms │     no change │
│ QQuery 46 │        12.15 / 12.57 ±0.33 / 12.98 ms │        11.91 / 12.08 ±0.11 / 12.19 ms │     no change │
│ QQuery 47 │     229.34 / 235.11 ±3.91 / 239.39 ms │     234.76 / 243.73 ±6.84 / 254.08 ms │     no change │
│ QQuery 48 │        96.71 / 97.54 ±0.62 / 98.34 ms │     100.33 / 101.97 ±2.50 / 106.95 ms │     no change │
│ QQuery 49 │        78.13 / 79.23 ±1.06 / 81.24 ms │        78.87 / 79.20 ±0.49 / 80.16 ms │     no change │
│ QQuery 50 │        60.04 / 60.40 ±0.24 / 60.72 ms │        60.33 / 63.91 ±4.26 / 70.68 ms │  1.06x slower │
│ QQuery 51 │       92.39 / 95.01 ±3.43 / 101.61 ms │       95.31 / 98.34 ±2.76 / 103.39 ms │     no change │
│ QQuery 52 │        24.14 / 24.38 ±0.27 / 24.89 ms │        24.52 / 24.76 ±0.21 / 25.05 ms │     no change │
│ QQuery 53 │        29.87 / 30.10 ±0.24 / 30.54 ms │        29.95 / 30.28 ±0.20 / 30.51 ms │     no change │
│ QQuery 54 │        55.58 / 56.31 ±0.40 / 56.78 ms │        56.18 / 58.88 ±4.03 / 66.84 ms │     no change │
│ QQuery 55 │        23.56 / 23.78 ±0.14 / 23.96 ms │        23.62 / 24.06 ±0.47 / 24.79 ms │     no change │
│ QQuery 56 │        39.09 / 39.96 ±0.57 / 40.74 ms │        39.20 / 39.69 ±0.42 / 40.32 ms │     no change │
│ QQuery 57 │     178.78 / 179.25 ±0.69 / 180.58 ms │     178.30 / 179.73 ±0.79 / 180.74 ms │     no change │
│ QQuery 58 │     116.42 / 119.18 ±3.73 / 126.42 ms │     117.96 / 118.94 ±0.63 / 119.54 ms │     no change │
│ QQuery 59 │     117.65 / 121.68 ±4.25 / 128.99 ms │     118.35 / 119.99 ±1.66 / 123.15 ms │     no change │
│ QQuery 60 │        39.25 / 40.39 ±0.64 / 40.98 ms │        39.44 / 40.25 ±0.75 / 41.65 ms │     no change │
│ QQuery 61 │        13.27 / 13.51 ±0.27 / 13.84 ms │        13.28 / 13.39 ±0.12 / 13.61 ms │     no change │
│ QQuery 62 │        46.11 / 46.51 ±0.28 / 46.88 ms │        46.89 / 48.45 ±2.82 / 54.09 ms │     no change │
│ QQuery 63 │        29.49 / 29.94 ±0.25 / 30.22 ms │        30.66 / 31.53 ±1.19 / 33.90 ms │  1.05x slower │
│ QQuery 64 │     393.38 / 398.72 ±4.32 / 406.62 ms │    401.67 / 417.84 ±16.92 / 448.06 ms │     no change │
│ QQuery 65 │     146.82 / 153.16 ±6.38 / 165.09 ms │     143.93 / 149.63 ±3.21 / 152.89 ms │     no change │
│ QQuery 66 │        78.86 / 84.01 ±7.83 / 99.53 ms │        81.31 / 84.52 ±5.30 / 95.08 ms │     no change │
│ QQuery 67 │     253.34 / 260.87 ±5.55 / 267.80 ms │     257.07 / 263.58 ±3.36 / 266.43 ms │     no change │
│ QQuery 68 │        12.06 / 12.25 ±0.13 / 12.44 ms │        12.22 / 12.33 ±0.10 / 12.50 ms │     no change │
│ QQuery 69 │        62.75 / 63.32 ±0.40 / 63.92 ms │        63.38 / 68.00 ±6.18 / 80.12 ms │  1.07x slower │
│ QQuery 70 │    106.19 / 114.35 ±11.83 / 137.05 ms │     107.84 / 110.28 ±2.94 / 114.43 ms │     no change │
│ QQuery 71 │        35.35 / 35.95 ±0.40 / 36.53 ms │        35.84 / 36.39 ±0.54 / 37.23 ms │     no change │
│ QQuery 72 │ 2058.69 / 2175.08 ±63.62 / 2232.46 ms │ 2139.15 / 2213.49 ±38.81 / 2253.11 ms │     no change │
│ QQuery 73 │           9.38 / 9.65 ±0.20 / 9.92 ms │          9.58 / 9.72 ±0.16 / 10.04 ms │     no change │
│ QQuery 74 │     173.14 / 177.60 ±2.68 / 180.90 ms │     173.85 / 181.21 ±6.16 / 191.89 ms │     no change │
│ QQuery 75 │     147.58 / 148.96 ±1.12 / 150.46 ms │     149.57 / 155.16 ±8.99 / 173.09 ms │     no change │
│ QQuery 76 │        35.11 / 37.01 ±2.12 / 41.08 ms │        35.41 / 35.80 ±0.29 / 36.28 ms │     no change │
│ QQuery 77 │        62.08 / 62.53 ±0.40 / 63.26 ms │        61.47 / 63.84 ±3.81 / 71.40 ms │     no change │
│ QQuery 78 │     191.12 / 194.23 ±2.22 / 197.94 ms │     193.92 / 196.85 ±2.08 / 200.39 ms │     no change │
│ QQuery 79 │        67.68 / 68.04 ±0.33 / 68.67 ms │        69.14 / 71.50 ±3.49 / 78.43 ms │  1.05x slower │
│ QQuery 80 │     101.69 / 107.53 ±9.15 / 125.68 ms │     101.73 / 103.23 ±0.92 / 104.21 ms │     no change │
│ QQuery 81 │        26.06 / 26.56 ±0.33 / 26.93 ms │        26.35 / 31.36 ±4.56 / 37.67 ms │  1.18x slower │
│ QQuery 82 │        16.56 / 16.64 ±0.08 / 16.80 ms │        17.59 / 17.89 ±0.32 / 18.46 ms │  1.08x slower │
│ QQuery 83 │        40.26 / 40.60 ±0.27 / 40.92 ms │        41.39 / 42.25 ±0.65 / 43.31 ms │     no change │
│ QQuery 84 │        34.75 / 34.84 ±0.08 / 34.97 ms │        35.42 / 35.79 ±0.30 / 36.20 ms │     no change │
│ QQuery 85 │     110.06 / 115.53 ±6.53 / 126.32 ms │     109.80 / 113.38 ±3.46 / 117.74 ms │     no change │
│ QQuery 86 │        25.19 / 25.71 ±0.38 / 26.09 ms │        26.44 / 26.86 ±0.36 / 27.53 ms │     no change │
│ QQuery 87 │        63.81 / 64.48 ±0.49 / 65.16 ms │        66.81 / 67.51 ±0.76 / 68.88 ms │     no change │
│ QQuery 88 │        62.87 / 66.02 ±3.11 / 70.95 ms │        62.92 / 63.20 ±0.26 / 63.61 ms │     no change │
│ QQuery 89 │        36.16 / 36.61 ±0.49 / 37.51 ms │        36.68 / 40.88 ±5.24 / 49.13 ms │  1.12x slower │
│ QQuery 90 │        16.84 / 17.10 ±0.25 / 17.50 ms │        17.26 / 17.55 ±0.21 / 17.89 ms │     no change │
│ QQuery 91 │        45.57 / 45.98 ±0.46 / 46.87 ms │        46.52 / 47.12 ±0.41 / 47.81 ms │     no change │
│ QQuery 92 │        29.48 / 29.98 ±0.31 / 30.39 ms │        31.24 / 31.55 ±0.22 / 31.86 ms │  1.05x slower │
│ QQuery 93 │        50.27 / 53.53 ±2.12 / 56.34 ms │        51.45 / 55.30 ±4.03 / 63.04 ms │     no change │
│ QQuery 94 │        38.91 / 39.41 ±0.50 / 40.33 ms │        39.63 / 42.75 ±3.23 / 47.88 ms │  1.08x slower │
│ QQuery 95 │        83.68 / 84.50 ±0.77 / 85.72 ms │        86.00 / 87.49 ±1.14 / 89.04 ms │     no change │
│ QQuery 96 │        23.85 / 24.12 ±0.23 / 24.48 ms │        24.47 / 24.83 ±0.29 / 25.31 ms │     no change │
│ QQuery 97 │        45.96 / 48.57 ±3.49 / 55.41 ms │        46.58 / 47.08 ±0.44 / 47.86 ms │     no change │
│ QQuery 98 │        42.43 / 43.26 ±0.66 / 44.26 ms │        43.69 / 44.39 ±0.36 / 44.70 ms │     no change │
│ QQuery 99 │        69.82 / 70.11 ±0.23 / 70.49 ms │        70.83 / 71.35 ±0.44 / 71.94 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 10574.30ms │
│ Total Time (lift-selectivity-stats)   │ 10739.18ms │
│ Average Time (HEAD)                   │   106.81ms │
│ Average Time (lift-selectivity-stats) │   108.48ms │
│ Queries Faster                        │          3 │
│ Queries Slower                        │         10 │
│ Queries with No Change                │         86 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric Value
Wall time 55.0s
Peak memory 6.7 GiB
Avg memory 6.1 GiB
CPU user 239.0s
CPU sys 6.4s
Peak spill 0 B

tpcds — branch

Metric Value
Wall time 55.0s
Peak memory 6.9 GiB
Avg memory 6.3 GiB
CPU user 243.1s
CPU sys 6.9s
Peak spill 0 B

File an issue against this benchmark runner

@adriangb adriangb force-pushed the lift-selectivity-stats branch 3 times, most recently from a24471d to 4d7b733 Compare June 2, 2026 02:29
@adriangb

adriangb commented Jun 2, 2026

Copy link
Copy Markdown
Contributor Author

run benchmark adversarial_filter

baseline:
  ref: lift-selectivity-stats
  env:
    DATAFUSION_EXECUTION_ADAPTIVE_FILTER_REORDERING: false
changed:
  ref: lift-selectivity-stats
  env:
    DATAFUSION_EXECUTION_ADAPTIVE_FILTER_REORDERING: true

@adriangbot

Copy link
Copy Markdown

Benchmark for this request failed.

Last 20 lines of output:

Click to expand
Cloning into '/workspace/datafusion-branch'...
From https://github.com/apache/datafusion
 * [new ref]         refs/pull/22698/head -> lift-selectivity-stats
 * branch            main                 -> FETCH_HEAD
Switched to branch 'lift-selectivity-stats'
85bc5ef7473d608604dc2e8bd81184505a1f6c19
From https://github.com/apache/datafusion
 * branch            refs/pull/22698/head -> FETCH_HEAD
Already on 'lift-selectivity-stats'
Cloning into '/workspace/datafusion-base'...
From https://github.com/apache/datafusion
 * branch            refs/pull/22698/head -> FETCH_HEAD
error: pathspec 'lift-selectivity-stats' did not match any file(s) known to git

File an issue against this benchmark runner

@adriangb adriangb force-pushed the lift-selectivity-stats branch from 4d7b733 to a622e10 Compare June 4, 2026 16:27
pull Bot pushed a commit to buraksenn/datafusion that referenced this pull request Jun 9, 2026
…ilter evaluation (apache#22704)

## Which issue does this PR close?

<!-- No tracking issue; this is a standalone benchmark contribution. -->

This PR does not close an issue. It adds a benchmark suite to support
work and
discussion around predicate ordering in filter evaluation (e.g. the
static
reordering in apache#22343 and the runtime/statistics-based reordering
explored in
apache#22698). It deliberately benchmarks *no specific implementation* — see
below.

## Rationale for this change

Conjunctive (`AND`) filter evaluation in `FilterExec` is a left-deep
`BinaryExpr(And)` chain, and the order conjuncts are evaluated in can
change
runtime by large factors: once a leading conjunct passes few enough rows
the
batch is physically compacted before the rest, so a cheap-and-selective
predicate evaluated early saves later predicates work. Predicate
ordering is
therefore an active area (static heuristics, runtime/adaptive schemes,
cost
models).

There is currently no benchmark suite that isolates the dimensions that
drive
this. Existing macro-benchmarks (TPC-H/DS, ClickBench) only incidentally
exercise filter ordering, so they can't show *why* a change to ordering
helped
or hurt, or guard the order-insensitive case against regressions.

## What changes are included in this PR?

A new SQL benchmark suite, `benchmarks/sql_benchmarks/predicate_eval`,
built on
the existing `.benchmark` template framework (no engine code, no new
Rust). It
sets no engine config of its own and measures DataFusion's built-in
short-circuit
by default; a system under test is toggled purely via its native
`DATAFUSION_EXECUTION_*` env var (the bench harness builds its
`SessionContext`
with `SessionConfig::from_env`), so the same scenarios can characterise
the
baseline, a static heuristic, an adaptive scheme, or a cost model and be
compared apples-to-apples.

It is organised into 10 subgroups (select with `BENCH_SUBGROUP`), each
varying
one property of conjunctive filter evaluation while holding the others
fixed:

| Subgroup | What it varies (others held fixed) |
|---|---|
| `costsel` | cost and selectivity point in different directions
(expensive predicate is the selective one) |
| `cost` | per-predicate cost, at equal selectivity |
| `selectivity` | per-predicate selectivity, at equal cost |
| `cardinality` | conjunct count `k = 2/4/8/16` |
| `width` | string-column width (`PRED_FILL` = 2 / 30 / 170 chars) |
| `scale` | row count `5k / 100k / 5M / 50M` |
| `neutral` | predicates are interchangeable (equal cost, none
selective) — an order-insensitive control |
| `correlation` | conditional vs marginal selectivity (independent /
positively / anti-correlated) |
| `drift` | selectivity that changes across the scan |
| `nulls` | null density (two- vs three-valued predicate results) |

Each query's comment notes the per-predicate cost/selectivity that the
data
generation hides from the SQL. Data is synthetic and generated inline by
each
subgroup's load SQL (no external files); `PRED_ROWS` sizes it and
`PRED_FILL`
sets string width. Wired into `bench.sh` (`./bench.sh run
predicate_eval`) and
documented in `benchmarks/sql_benchmarks/README.md`.

The design was informed by surveying how Velox drives the analogous
decision
(it ranks by cycles-per-row-eliminated, `time / (rows_in - rows_out)`).

> Note: the `scale` subgroup's `q52`/`q53` build 5M / 50M-row tables
(the latter
> ~9 GB); run a single point with `BENCH_QUERY` if that is too heavy.

## Are these changes tested?

These are benchmark definitions, not engine code. Each `.benchmark`
includes an
`assert` that the generated table is non-empty, and every subgroup was
run
locally at small `PRED_ROWS` to confirm the suite parses, loads,
asserts, and
executes end-to-end. The queries are order-invariant (`SELECT count(*)
...`), so
any predicate-ordering system can also be checked for correctness by
diffing
counts with the optimization on vs. off.

## Are there any user-facing changes?

No. This only adds an opt-in benchmark suite and its documentation; no
public
API, engine behavior, or default configuration changes.

---------

Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
adriangb and others added 4 commits June 8, 2026 21:43
Add a shared, policy-free substrate for runtime-adaptive filtering under
`adaptive`:

- `SelectivityStats`: per-predicate online accumulator of selectivity
  (pass rate), cost (eval nanos), and a caller-supplied effectiveness
  sample with Welford mean/variance and one-sided confidence bounds.
- `AdaptiveStatsRegistry`: concurrent `FilterId -> stats` map with
  per-predicate skip flags, for a shared/multi-threaded consumer.

The kernel defines no placement or ordering policy — consumers (an
adaptive `FilterExec`, later the parquet scan) layer their own ranking
function on top. `FilterId` is registry-local; there is no global id.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Experimental, off by default. Gates runtime-adaptive reordering of the
conjuncts of a conjunctive `FilterExec` predicate. Regenerate configs.md
and the information_schema config listing.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
When `execution.adaptive_filter_reordering` is on and the predicate is a
multi-conjunct `AND` with no volatile expressions, FilterExec evaluates the
conjuncts in a measured order instead of as a single fused predicate.

- Conjuncts are evaluated sequentially with threshold-gated compaction
  (mirroring BinaryExpr's pre-selection), measuring each conjunct's marginal
  selectivity and cost per batch via stream-local `Vec<SelectivityStats>`
  (ids are dense 0..n, no locking).
- Conjuncts are ranked by mean discards-per-second (= minimising
  cost_per_row / (1 - pass_rate)); the order is committed once it is
  statistically certain (adjacent effectiveness confidence intervals do not
  overlap), or after a small sample cap if they are indistinguishable.
- On freeze the conjuncts are fused into a left-deep AND in the learned order
  and evaluated as an ordinary predicate, so the steady state pays no adaptive
  overhead and reuses BinaryExpr's pre-selection. A frozen evaluator
  periodically re-thaws to detect distribution drift, backing the interval off
  exponentially while the order is stable.

State is stream-local; the plan, results, and EXPLAIN are unchanged.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
adaptive_filter.slt: results and EXPLAIN are identical with the flag on
and off (reordering changes evaluation order only).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@adriangb adriangb force-pushed the lift-selectivity-stats branch from a622e10 to 24cf5c3 Compare June 9, 2026 02:45
@adriangb

adriangb commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

run benchmark predicate_eval

@adriangbot

Copy link
Copy Markdown

🤖 Criterion benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4659604910-502-wj4fm 6.12.68+ #1 SMP Sat May 2 07:49:07 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (24cf5c3) to 0f8a121 (merge-base) diff
BENCH_NAME=predicate_eval
BENCH_COMMAND=cargo bench --features=parquet --bench predicate_eval
BENCH_FILTER=
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

Benchmark for this request failed.

Last 20 lines of output:

Click to expand
    substr
    substr_index
    substring
    sum
    to_char
    to_hex
    to_local_time
    to_time
    to_timestamp
    topk_aggregate
    topk_repartition
    translate
    trim
    trunc
    unhex
    unions_to_filter
    upper
    uuid
    window_query_sql
    with_hashes

File an issue against this benchmark runner

@adriangb

adriangb commented Jun 9, 2026

Copy link
Copy Markdown
Contributor Author

run benchmark predicate_eval

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4659749902-504-m4xmf 6.12.68+ #1 SMP Sat May 2 07:49:07 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (24cf5c3) to 0f8a121 (merge-base) diff using: predicate_eval
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and lift-selectivity-stats
Note: Skipping /workspace/datafusion-bench/benchmarks/results/HEAD/*.json as /workspace/datafusion-bench/benchmarks/results/lift-selectivity-stats/*.json does not exist

Resource Usage

predicate_eval — base (merge-base)

Metric Value
Wall time 970.2s
Peak memory 61.5 GiB
Avg memory 18.3 GiB
CPU user 3490.6s
CPU sys 77.5s
Peak spill 0 B

predicate_eval — branch

Metric Value
Wall time 1075.2s
Peak memory 63.1 GiB
Avg memory 18.2 GiB
CPU user 3834.5s
CPU sys 84.4s
Peak spill 0 B

File an issue against this benchmark runner

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

auto detected api change Auto detected API change common Related to common crate documentation Improvements or additions to documentation physical-expr Changes to the physical-expr crates physical-plan Changes to the physical-plan crate sqllogictest SQL Logic Tests (.slt)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants